Mathematical morphology

Mathematical morphology (MM) is a theory and technique for the analysis and processing of geometrical structures, based on set theory, lattice theory, topology, and random functions. MM is most commonly applied to digital images, but it can be employed as well on graphs, surface meshes, solids, and many other spatial structures.

Topological and geometrical continuous-space concepts such as size, shape, convexity, connectivity, and geodesic distance, were introduced my by MM on both continuous and discrete spaces. MM is also the foundation of morphological image processing, which consists of a set of operators that transform images according to the above characterizations.

MM was originally developed for binary images, and was later extended to grayscale functions and images. The subsequent generalization to complete lattices is widely accepted today as MM's theoretical foundation.

Contents

History

Mathematical Morphology was born in 1964 from the collaborative work of Georges Matheron and Jean Serra, at the École des Mines de Paris, France. Matheron supervised the PhD thesis of Serra, devoted to the quantification of mineral characteristics from thin cross sections, and this work resulted in a novel practical approach, as well as theoretical advancements in integral geometry and topology.

In 1968, the Centre de Morphologie Mathématique was founded by the École des Mines de Paris in Fontainebleau, France, led by Matheron and Serra.

During the rest of the 1960s and most of the 1970s, MM dealt essentially with binary images, treated as sets, and generated a large number of binary operators and techniques: Hit-or-miss transform, dilation, erosion, opening, closing, granulometry, thinning, skeletonization, ultimate erosion, conditional bisector, and others. A random approach was also developed, based on novel image models. Most of the work in that period was developed in Fontainebleau.

From mid-1970s to mid-1980s, MM was generalized to grayscale functions and images as well. Besides extending the main concepts (such as dilation, erosion, etc...) to functions, this generalization yielded new operators, such as morphological gradients, top-hat transform and the Watershed (MM's main segmentation approach).

In the 1980s and 1990s, MM gained a wider recognition, as research centers in several countries began to adopt and investigate the method. MM started to be applied to a large number of imaging problems and applications.

In 1986, Jean Serra further generalized MM, this time to a theoretical framework based on complete lattices. This generalization brought flexibility to the theory, enabling its application to a much larger number of structures, including color images, video, graphs, meshes, etc. At the same time, Matheron and Serra also formulated a theory for morphological filtering, based on the new lattice framework.

The 1990s and 2000s also saw further theoretical advancements, including the concepts of connections and levelings.

In 1993, the first International Symposium on Mathematical Morphology (ISMM) took place in Barcelona, Spain. Since then, ISMMs are organized every 2–3 years, each time in a different part of the world: Fontainebleau, France (1994); Atlanta, USA (1996); Amsterdam, Netherlands (1998); Palo Alto, CA, USA (2000); Sydney, Australia (2002); Paris, France (2004); Rio de Janeiro, Brazil (2007); Groningen, Netherlands (2009); and Intra (Verbania), Italy (2011).

References

Binary morphology

In binary morphology, an image is viewed as a subset of an Euclidean space \mathbb{R}^d or the integer grid \mathbb{Z}^d, for some dimension d.

Structuring element

The basic idea in binary morphology is to probe an image with a simple, pre-defined shape, drawing conclusions on how this shape fits or misses the shapes in the image. This simple "probe" is called structuring element, and is itself a binary image (i.e., a subset of the space or grid).

Here are some examples of widely used structuring elements (denoted by B):

Basic operators

The basic operations are shift-invariant (translation invariant) operators strongly related to Minkowski addition.

Let E be a Euclidean space or an integer grid, and A a binary image in E.

Erosion

The erosion of the binary image A by the structuring element B is defined by:

A \ominus B = \{z\in E | B_{z} \subseteq A\},

where Bz is the translation of B by the vector z, i.e., B_z = \{b%2Bz|b\in B\}, \forall z\in E.

When the structuring element B has a center (e.g., B is a disk or a square), and this center is located on the origin of E, then the erosion of A by B can be understood as the locus of points reached by the center of B when B moves inside A. For example, the erosion of a square of side 10, centered at the origin, by a disc of radius 2, also centered at the origin, is a square of side 6 centered at the origin.

The erosion of A by B is also given by the expression: A  \ominus B = \bigcap_{b\in B} A_{-b}.

Example application: Assume we have received a fax of a dark photocopy. Everything looks like it was written with a pen that is bleeding. Erosion process will allow thicker lines to get skinny and detect the hole inside the letter "o".

Dilation

The dilation of A by the structuring element B is defined by:

A  \oplus B = \bigcup_{b\in B} A_b.

The dilation is commutative, also given by: A  \oplus B = B\oplus A = \bigcup_{a\in A} B_a.

If B has a center on the origin, as before, then the dilation of A by B can be understood as the locus of the points covered by B when the center of B moves inside A. In the above example, the dilation of the square of side 10 by the disk of radius 2 is a square of side 14, with rounded corners, centered at the origin. The radius of the rounded corners is 2.

The dilation can also be obtained by: A  \oplus B = \{z \in E| (B^{s})_{z} \cap A\neq \varnothing\}, where Bs denotes the symmetric of B, that is, B^{s}=\{x\in E | -x \in B\}.

Example application: Dilation is the opposite of the erosion. Figures that are very lightly drawn get thick when "dilated". Easiest way to describe it is to imagine the same fax/text is written with a thicker pen.

Opening

The opening of A by B is obtained by the erosion of A by B, followed by dilation of the resulting image by B:

A \circ B  = (A \ominus B) \oplus B .

The opening is also given by A \circ B = \bigcup_{B_x\subseteq A} B_x, which means that it is the locus of translations of the structuring element B inside the image A. In the case of the square of side 10, and a disc of radius 2 as the structuring element, the opening is a square of side 10 with rounded corners, where the corner radius is 2.

Example application: Let's assume someone has written a note on a non-soaking paper that writing looks like it is growing tiny hairy roots all over. Opening essentially removes the outer tiny "hairline" leaks and restores the text. The side effect is that it rounds off things. The sharp edges start to disappear.

Closing

The closing of A by B is obtained by the dilation of A by B, followed by erosion of the resulting structure by B:

A \bullet B  = (A \oplus B) \ominus B .

The closing can also be obtained by A \bullet B = (A^{c} \circ B^{s})^{c}, where Xc denotes the complement of X relative to E (that is, X^{c}=\{x\in E | x\not \in X\}). The above means that the closing is the complement of the locus of translations of the symmetric of the structuring element outside the image A.

Properties of the basic operators

Here are some properties of the basic binary morphological operators (dilation, erosion, opening and closing):

Other operators and tools

Grayscale morphology

In grayscale morphology, images are functions mapping a Euclidean space or grid E into \mathbb{R}\cup\{\infty,-\infty\}, where \mathbb{R} is the set of reals, \infty is an element larger than any real number, and -\infty is an element smaller than any real number.

Grayscale structuring elements are also functions of the same format, called "structuring functions".

Denoting an image by f(x) and the structuring function by b(x), the grayscale dilation of f by b is given by

(f\oplus b)(x)=\sup_{y\in E}[f(y)%2Bb(x-y)],

where "sup" denotes the supremum.

Similarly, the erosion of f by b is given by

(f\ominus b)(x)=\inf_{y\in E}[f(y)-b(y-x)],

where "inf" denotes the infimum.

Just like in binary morphology, the opening and closing are given respectively by

f\circ b=(f\ominus b)\oplus b, and
f\bullet b=(f\oplus b)\ominus b.

Flat structuring functions

It is common to use flat structuring elements in morphological applications. Flat structuring functions are functions b(x) in the form

b(x)=\left\{\begin{array}{ll}0,&x\in B,\\-\infty,&\mbox{otherwise}\end{array}\right.,

where B\subseteq E.

In this case, the dilation and erosion are greatly simplified, and given respectively by

(f\oplus b)(x)=\sup_{z\in B^{s}}f(x%2Bz), and
(f\ominus b)(x)=\inf_{z\in B}f(x%2Bz).

In the bounded, discrete case (E is a grid and B is bounded), the supremum and infimum operators can be replaced by the maximum and minimum. Thus, dilation and erosion are particular cases of order statistics filters, with dilation returning the maximum value within a moving window (the symmetric of the structuring function support B), and the erosion returning the minimum value within the moving window B.

In the case of flat structuring element, the morphological operators depend only on the relative ordering of pixel values, regardless their numerical values, and therefore are especially suited to the processing of binary images and grayscale images whose light transfer function is not known.

Other operators and tools

By combining these operators one can obtain algorithms for many image processing tasks, such as feature detection, image segmentation, image sharpening, image filtering, and classification.

Mathematical morphology on complete lattices

Complete lattices are partially ordered sets, where every subset has an infimum and a supremum. In particular, it contains a least element and a greatest element (also denoted "universe").

Adjunctions (Dilation and Erosion)

Let (L,\leq) be a complete lattice, with infimum and supremum symbolized by \wedge and \vee, respectively. Its universe and least element are symbolized by U and \emptyset, respectively. Moreover, let \{ X_{i} \} be a collection of elements from L.

A dilation is any operator \delta\colon L\rightarrow L that distributes over the supremum, and preserves the least element. I.e.:

An erosion is any operator \varepsilon\colon L\rightarrow L that distributes over the infimum, and preserves the universe. I.e.:

Dilations and erosions form Galois connections. That is, for all dilation \delta there is one and only one erosion \varepsilon that satisfies

X\leq \varepsilon(Y)\Leftrightarrow \delta(X)\leq Y

for all X,Y\in L.

Similarly, for all erosion there is one and only one dilation satisfying the above connection.

Furthermore, if two operators satisfy the connection, then \delta must be a dilation, and \varepsilon an erosion.

Pairs of erosions and dilations satisfying the above connection are called "adjunctions", and the erosion is said to be the adjoint erosion of the dilation, and vice-versa.

Opening and Closing

For all adjunction (\varepsilon,\delta), the morphological opening \gamma\colon L\rightarrow L and morphological closing \phi\colon L\rightarrow L are defined as follows:

\gamma = \delta\varepsilon, and
\phi = \varepsilon\delta.

The morphological opening and closing are particular cases of algebraic opening (or simply opening) and algebraic closing (or simply closing). Algebraic openings are operators in L that are idempotent, increasing, and anti-extensive. Algebraic closings are operators in L that are idempotent, increasing, and extensive.

Particular cases

Binary morphology is a particular case of lattice morphology, where L is the power set of E (Euclidean space or grid), that is, L is the set of all subsets of E, and \leq is the set inclusion. In this case, the infimum is set intersection, and the supremum is set union.

Similarly, grayscale morphology is another particular case, where L is the set of functions mapping E into \mathbb{R}\cup\{\infty,-\infty\}, and \leq, \vee, and \wedge, are the point-wise order, supremum, and infimum, respectively. That is, is f and g are functions in L, then f\leq g if and only if f(x)\leq g(x),\forall x\in E; the infimum f\wedge g is given by (f\wedge g)(x)=f(x)\wedge g(x); and the supremum f\vee g is given by (f\vee g)(x)=f(x)\vee g(x).

See also

References

External links